Improved Binaural Model for Localization of Multiple Sources
نویسندگان
چکیده
An improved binaural hearing model is proposed that consists of a physiologically motivated signal processing step and a subsequent cognitive model. The model is shown to be capable of correctly detecting and tracking sources while blindly determining the number of active sources based on temporal and spectral information. The complete model is of interest in all areas that need to consider the capabilities of the human hearing system while the ability to determine the number of active sources makes it a logical enhancement for source separation and spatial signal processing such as adaptive beamforming.
منابع مشابه
Robust Localization of Multiple Speech Sources Based on Time Difference of Arrival in Real Environments for Binaural Robot Audition
This paper presents a multisource speech localization method based on the generalized cross-correlation (GCC) method weighted by the phase transform (PHAT) for binaural robot audition. The direction-of-arrival (DOA) estimation based on the GCC-PHAT method was extended to enable simultaneous multiple DOA estimations with a signal-to-noise ratio (SNR)-based weighting function. The standard K-mean...
متن کاملSpatial Hearing Algorithms Based on Binaural Zero-Crossings: Sound Source Localization, Segregation, and Dereverberation
This thesis concerns a new zero-crossing-based binaural model for spatial hearing. Conventional binaural model computes cross-correlations of binaural signals for the estimation of the interaural time difference which is a primary spatial cue. However, the cross-correlationbased binaural processing model requires high computational complexity and suffers from inaccuracies in localizing sound so...
متن کاملImprovement of Sound Source Localization for a Binaural Robot of Spherical Head with Pinnae
II diffraction problem was overcome by incorporating a new time delay factor into the GCC-PHAT method under the assumption of a spherical robot head. The ambiguity problem was overcome by utilizing the amplification effect of the pinnae. Finally the difficulties with multisource sound localization in real environments were addressed by extending the proposed ML-based SSL method using the new ti...
متن کاملA computational model of binaural localization and separation
Multiple sound signals, such as speech and interfering noises, can be fairly well separated, localized, and interpreted by human listeners with normal binaural hearing. The computational model presented here, based on earlier cochlear modeling work, is a first step at approaching human levels of performance on the localization and separation tasks. This combination of cochlear and binaural mode...
متن کاملAn Active Machine Hearing System for Auditory Stream Segregation
This study describes a binaural machine hearing system that is capable of performing auditory stream segregation in scenarios where multiple sound sources are present. The process of stream segregation refers to the capability of human listeners to group acoustic signals into sets of distinct auditory streams, corresponding to individual sound sources. The proposed computational framework mimic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012